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(54)Titie: DETECTING TUMOURS 
(57) Abstract 

A method of determining whether a human patient has 
a tumour or whether a tumour has metastasised comprising 
the steps of (1) obtaining a sample of tissue from the patient, 
the said tissue being one that does not normally contain 
a cytokeratin 20 (CK20) gene product and (2) determining 
whether a cytokeratin 20 (CiC20) gene product is present in 
said tissue sample. Once it has been deteimined whether a 
human patient has a tumour or is likely to develop metastatic 
disease from such a tumour, the physician can decide on -an 
appropriate course of treatment, 



244bp- 




M 100 10 



r>0 



PO 



ii 



214bp- 




M 100 10 1 100 10 



1 w 



ng 



PO 




4S0OCID: <WO 9617080A1 J_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify Steles party to the PCT on the front pages of pamphlets publishing intematicmal 
applications under the PCT. 



AT 


Aniria 


GB 


Unitad Kmgdom 


MR 


Mauritania 


AU 


Aumlift 


GE 


Geoisia 


MW 


MaUwi 


BB 


Bubiilos 


GN 


Guinea 


NE 


Niger 

Netterlands 


BE 


BdgitMB 


GR 


Gneoe 


NL 


BF 


Buifcioa Piso 


m 


HuDgaiy 


NO 


Norway 


BG 


Bulgirii 


a 


IieUBd 


NZ 


New Zealand 


BJ 




IT 


baly 


PL 


l^oland 


BR 


Brazil 


JP 


iapaa 


FT 


taiugal 


BY 


Belanit 


KE 


Keaya 


RO 


Romania 


CA 


CaiiMla 


KG 


Kyifyttas 


UV 


Russian Federatioo 


CF 


Goml Africao Republic 


KP 


Democratic Podple't Republic 


SD 


Sudan 


CG 


Coqgo 




of Korea 


SE 


Sweden 


CH 


SwinerUukd 


KR 


Republic of Korea 


SI 


Slovenia 


Ci 


COied'lvoUe 


KZ 


KazakfaBiao 


5K 


Slovakia 


CM 


Cimcrooo 


U 


LaedaeuieiD 


SN 


Senegal 


CN 


Cbiu 


uc 


Sri Lanka 


TD 


Chad 


cs 


Czecfaotlovaku 


LU 


LuaemlKMis 


TG 


Togo 


cz 


Cndi RfpwMir 


LV 


Latvia 


TJ 


Tajikittan 


DE 


Ocmaay 


MC 


Mooaoo 


TT 


Trinidad and Tobago 


DK 


Deomttt 


MD 


Republic of Moldova 


UA 


Ukiaine 


BS 


Span 


MG 


Madagascar 


US 


United States of America 


n 


FiDlttid 


ML 


MaU 


UZ 


Uxbekhian 


FR 


Ffaacc 


MN 


MoQfolia 


VN 


V'm Nam 


CA 


GabOB 











4SDC3CID: <WO_9617080A1.L> 



wo 96/17080 PCT/GB9SA>2734 

DETErTTNG TUMOim.Q 

The present invention relates to methods of detecting tumours including 
metastatic disease in a patient, particularly metastatic disease of epithelial 
5 cell tumour origin, more particularly disseminating colon carcinoma. 

The development and growth of malignant tumours or cancers commonly 
results in the release of some of the cancerous cells from the developing 
tumour into the blood or other body fluids. These cells are then 

10 transported to other parts of the body where they may become implanted 
and set up secondary tumours or metastases, thus leading to a general 
dissemination or spread of the original cancer that is responsible for 
production of the primary tumour. The process of metastasis may 
commence at quite an early stage in the development and growth of the 

15 primary tumour, and in fact it is metastases, frequently haematogenous 
metastases produced by cancer tumour cells circulating in the peripheral 
blood, that determine the outcome of the disease for most patients. 



It is imporunt to detect such cancer cells in body fluids, particularly 
20 peripheral blood, as this may aid in diagnosing the original cancer and 
monitoring the disease. In particular, detection of such cancer cells in the 
peripheral blood is an indicator of the likelihood of metastatic disease and 
is useful to the physician in deciding upon a suitable course of treatment. 

25 The number of such cancer or tumour cells circulating in such body fluids, 
particularly peripheral blood, is generally very small and they cannot 
therefore be distinguished and readily detected by routine microscopy. 
Techniques for their detection therefore need to be highly sensitive but 
must remain speciflc. 

30 
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Because of the huge number of blood cells compared with potential cancer 
or tumour cells it is very important that any marker that is used to detect 
the said cancer or tumour cells is not found in the normal blood cells. 



5 Various methods have been described previously in an attempt to develop 
methods of detecting metastatic disease by analysing blood and bone 
marrow samples for cancer cells. Moss & Sanders (1990) J. Clin. Oncol. 
8, 736-740 describes the detection of neuroblastoma cells in blood using 
monoclonal antibodies reactive against unidentified neuroblastoma cell 
10 antigens. 



Sawyers et al (1990) Proc. Natl Acad. ScL USA 87, 563-567 discloses a 
method of detecting chronic myelogenous leukaemia (CML) cells in the 
blood of a patient using the polymerase chain reaction (PGR), In this case 
15 cellular mRNA was extracted from blood samples and cDNA made using 
reverse transcriptase (RT). Thus, a RT-PCR was used to amplify cDNA 
corresponding to mRNA transcribed from the abnormal gene found in 
CML. 



20 GB 2 260 811 A discloses a general method for the diagnosis or 
monitoring of cancer of a malignant tumour using a RT-PCR. In this 
case, melanoma cells were detected in blood of a patient using a RT-PCR. 
Specific oligonucleotide primers were used to amplify tyrosinase mRNA. 
Tyrosinase is not expressed in normal blood but is expressed at a 

25 relatively high level in some melanoma cells (which are derived from 
melanocytes); however, analysis of tyrosinase gene expression is 
complicated by the presence of tyrosinase-related protein genes and the 
method is limited because some melanomas lack tyrosinase expression 
(amelanotic tumours). 
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Burchill et al (1994) Int. J. Cancer 51, 671-675 describes neuroblastoma 
cell detection by RT-PCR for tyrosine hydroxylase mRNA. 



It will be appreciated that melanoma, neuroblastoma and CML are 
5 tumours arising from highly specialised cell types and, in the case of 
CML, arising from a cell containing chromosomal abnormality. Some of 
the most prevalent cancers arise from less specialised epithelial cells, for 
example breast cancer and colon carcinoma. Clearly, any method of 
detecting cells in the blood derived from such epithelial cell tumours must 
10 rely on the marker to be detected not being expressed in normal blood. 

Cytokeratins (CKs) are components of mammalian cell cytoskeleton and 
constitute a multigene family of proteins (see Nagel (1988) Am. J, Surg. 
Pathol 12, (suppl. 1), 4-16) and Moll et al (1982) Cell 31, 11-24 for 
15 reviews). CKs are expressed predominantly in epithelial cells where they 
show strict lineage and differentiation-associated patterns of expression. 
Malignant cells generally retain the CKs of their progenitor cell type and 
have been used previously to characterise neoplastic cells of epithelial 
origin. 

20 

Traweek et al (1993) Am. J. Pathol. 142, 1111-1118 have used RT-PCR 
to detect the expression of CK8, CK18 and CKI9 in various tissues. CK8 
and CK18 are expressed in all tissue that has been studied including 
peripheral blood mononuclear cells, aspirated bone marrow cells and 
25 lymph nodes. Some of these cells are components of normal blood and 
so CK8 and CK18 are expressed in normal blood and, consequently, are 
of no use in analysing blood samples for the presence of cells derived 
from an epithelial cell tumour. 

30 Although Traweek et al apparently did not detect CK19 gene activity in 
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normal peripheral blood and bone marrow, they found that the CK19 
activity in stromal cells caused problems and that CK19 expression was 
detected in lymph nodes. Datta et al (1994) J. Clin. Oncol 12, 475-482 
have used RT-PCR to analyse the expression of CK19 in breast cancer 
S patients and, apparently, no gene expression was detected in the blood of 
patients at stages I, II or III but only ten of twenty-six patients with 
metastatic disease gave a positive result using the RT-PCR test. However, 
a low frequency of CK19 expression was found in normal blood indicating 
illegitimate transcription of CK19 mRNA or the presence of CK19- 
10 expressing cells in normal blood or bone marrow. Furthermore, as 
discussed in more detail in the Examples of the present invention, we have 
found CK19 expression using RT-PCR in six out of fifteen control blood 
samples. 

IS Thus, CK19, like CK8 and CK18, is not a suitable target for detection of 
tumour cells in peripheral blood. 

Surprisingly, we have found that CK20 is not expressed in normal blood 
samples. CK20 is expressed, for example, in a high proportion of 

20 colorectal carcinomas, stomach cancers, mucinous ovarian 
adenocarcinomas and transitional cell carcinomas. Thus, it is an object of 
the present invention to provide methods for detecting these and other 
tumours and, in particular, metastatic disease, by detecting the presence 
of CK20 gene expression in the blood or bone marrow or other suitable 

25 tissue of a patient. 

One asp}ect of the present invention provides a method of determining 
whether a human patient has a tumour or whether a tumour has 
metastasised comprising the steps of (1) obtaining a sample of tissue from 
30 the patient, the said tissue being one that does not normally contain a 
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cytokeratin 20 (CK20) gene product and (2) determining whether a 
cytokeratin 20 (CK20) gene product is present in said tissue sample. 

By "determining whether a tumour has metastasised"* we include 
5 determining whether any tumour cells are found at a site remote from a 
primary tumour whether or not those tumour cells are found in a further 
tumour. 



Thus, the method includes determining the likelihood that a tumour has 
10 spread or is spreading or will spread in a patient. 

It will be appreciated that, in essence, the invention comprises determining 
whether a tissue sample from a patient contains a cytokeratin 20 (CK20) 
gene product, particularly a tissue sample which does not normally contain 
15 a cytokeratin 20 (CK20) gene product. 

By "CK20 gene product" we include CK20 mRNA and CK20 protein. 

It will be appreciated by a person skilled in the art that the human CK20 
20 gene, as is the case for many human genes, is polymorphic and therefore 
that many allelic forms occur. Thus, by "CK20 gene" we include all 
allelic forms. Different allelic forms can be readily detected by comparing 
one CK20 gene or mRNA sequence with another, for example by DNA 
sequencing or restriction fragment length polymorphism (RFLP) analysis. 

25 

We have found unexpectedly that a CK20 gene product is not present in 
blood or bone marrow from a human who does not suffer from a tumour 
or, particularly, in whom a tumour has not metastasised. Thus, it is 
preferred that the tissue sample is blood or bone marrow. Blood is readily 
30 obtained from the patient using venepuncture whereas bone marrow is 
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obtained using standard aspiration techniques (for example by placing a 
needle into the iliac bone and aspirating). Purged bone marrow is 
suitable. Alternatively, lymph nodes, peripheral stem cell harvests, urine 
and cystectomy samples may be used. Thus, we include urine in the term 
5 "tissue sample". A suitable sample also includes polymorphonuclear cells 
such as neutrophils. 

When we state that a CK20 gene product is not present in blood or bone 
marrow from a human who does not suffer from a tumour we mean that 

10 the amount of any CK20 gene product is so low that it cannot be detected, 
at least by presently available techniques. The amount of CK20 gene 
product present in the tissue (such as blood) of a human patient who is 
determined by the method of the invention to have CK20 gene product 
present in said tissue, compared to a human who does not have a CK20 

15 gene product (as defined), is at least two-fold higher, preferably at least 
10-fold higher, more preferably at least 100-fold higher and most 
preferably at least 1000-fold higher. 

It is preferred if the tumour is an epithelial cell tumour. 

20 

There are many epithelial cell-derived tumours including breast carcinoma 
colorectal carcinoma, stomach adenocarcinoma, mucinous ovarian 
adenocarcinoma, all bladder carcinoma including dysplasia and transitional 
cell carcinoma. The vast majority of mucinous ovarian and colorectal 
25 adenocarcinomas express CK20; greater than 75% of stomach and gall 
bladder adenocarcinomas contain CK20-expressing cells; and more than 
half of pancreatic adenocarcinomas contain CK20-expressing cells. 

It is less preferred if the tumour is a lung tumour. 

30 
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A high proportion of Merkel cell carcinomas and transitional cell 
carcinomas express CK20. 



Thus, the method of the invention is particularly suited to determine 
5 whether a patient has, or is likely to develop metastatic disease from, 
mucinous ovarian and colorectal adenocarcinomas. 

Determination of colorectal adenocarcinoma and metastasis therefrom are 
particularly preferred. 

10 

It is preferred that the patient has not suffered a local trauma or has not 
undergone surgery both of which may result in the release of epithelial 
cells into the blood. 

15 Presence of a cytokeratin 20 gene product can be determined in a tissue 
sample using various techniques. Conveniently, it is determined by 
detecting CK20 messenger RNA (mRNA). Although, in principle, CK20 
mRNA can be detected in a tissue sample by hybridising a specific nucleic 
acid probe to the mRNA (such as an antisense RNA or antisense 

20 oligonucleotide, the said probe being labelled with a readily-detectable 
moiety for example, a fluorescent dye or a radionuclide), the sensitivity 
of such a method may not allow the detection of the very small amount of 
CK20 mRNA that is present in a tissue sample. For example, a 5 ml 
blood sample may contain only tens of CK20-expressing cells derived 

25 from the epithelial cell tumour but tens of thousands of cells in total. 
Thus, it is preferred that the mRNA is detected by, first, making a 
complementary DNA (cDNA) copy of the CK20 mRNA and, second, 
amplifying the cDNA using DNA amplification methods. 



30 



In a particularly preferred embodiment the following steps are used: 



8 

A tissue sample is obtained from the patient. Conveniently. 2 ml 
of blood is removed from the patient and frozen at -80^C. 

Total cellular RNA is extracted from the blood sample, for example 
using standard techniques described in Sambrook et al (1989) 
Molecular Biology, a laboratory manual. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, USA, Alternatively, 
commercially available kits can be used such as Ultraspec™ RNA 
(a trade mark) available from Biogenesis, Bournemouth. UK. 

The mRNA is converted into cDNA using suitable oligonucleotide 
primers, deoxynudeotides and an enzyme with reverse transcriptase 
activity. The oligonucleotide for making cDNA can be, for 
example, either: 

a) An oligo-dT primer which hybridises to the polyA tail of 
mRNA; 

b) Random short sequences, such as random hexamers. which 
prime cDNA synthesis of the total RNA; or 

c) Oligonucleotides specific for CK20 which prime cDNA 
synthesis from CK20 mRNA. 

Optionally any residual chromosomal DNA is removed using a 
nuclease such as DNase I. 

OligonucleoUde primers are selected which direct specific 
amplification of the CK20 cDNA. A PCR is performed using these 
oligonucleotides, a thermal-stable DNA polymerase and 
deoxynudeotides. 
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6. Optionally, further oligonucleotides are selected which direct 
specific amplification of a DNA fragment chosen irorn the inter- 
primer region defined in step S and a further PCR (a nested" 
PCR) is carried out. 

5 

7. The product of the PCR reaction of step S or 6 is detected using 
agarose gel electrophoresis and ethidium bromide staining of the 
DNA. 

10 Nucleotide sequences of CK20 cDNAs are shown in Figures 5 to 7. The 
cDNA and gene sequence described in Moll et al (1993) Differentiation 
53, 75-93 is incorporated herein by reference as are the sequences and 
information from the relevant database submissions described in the 
legends to Figures 5 to 7. The approximate position of the introns (in the 

IS gene) are marked. It is preferred if oligonucleotides suitable for DNA 
amplification comprise a sequence selected from the sequence shown in 
any one of Figures 5 to 7 such that the first oligonucleotide is capable of 
hybridizing to one exon and the second oligonucleotide is capable of 
hybridizing to another exon. Preferably the oligonucleotide primers are 

20 between 10 and SO nucleotides in length, more preferably between 14 and 
30 nucleotide long, most preferably around 20 nucleotides long. 

It is preferred if the oligonucleotide primers can hybridize to all alleles of 
the CK20 gene. Regions of the CK20 gene and cDNAs common to all 
2S alleles can be determined by comparing the sequences of CK20 genes and 
cDNAs such as those shown in Figures S to 7. 



Preferred oligonucleotide primers comprise or consist of the sequence S'- 
CAGACACACGGTGAACTATGG-3' (SEQ ID No 1) and S'- 
30 GATCAGCTTCCACTGTTAGACG-3' (SEQ ID No 2). Further preferred 
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oligonucleotide primer pairs are (all listed 5' -* 3'): 

CTCCTGGAATCTCCAATGG (SEQ ID No 3) and 
GCATnTGCAGTTGAGCATCC (SEQ ID No 4); 

5 

CTCCAATGGAnrCAGTCG (SEQ ID No 5) and 
AATITGCAGGACACACCGAGC (SEQ ID No 6); 

CTAAATGACCGTCTAGCGAGC (SEQ ID No 7) and 
10 TCCACATTGACAGTGTTGCCC (SEQ ID No 8); 

CCAACTCCAAACTTGAAGTGC (SEQ ID No 9) and 
TCCACATTGACAGTGTTGCCC (SEQ ID No 10); and 

15 TGGGCAACACTGTCAATGTGG (SEQ ID No 11) and 
TCCATGTTACTCCGAATCTGC (SEQ ID No 12). 

It is well known that the sequence at the 5' end of the oligonucleotide 
need not match the target sequence to be amplified. 

20 

It is usual that the PCR primers do not contain any complementary 
structures with each other longer than 2 bases, especially at their 3' ends, 
as this feature may promote the formation of an artifactual product called 
"primer dimer". When the 3' ends of the two primers hybridize, they 
25 form a "^primed template" complex, and primer extension results in a short 
duplex product called "primer dimer". 



30 



Internal secondary structure should be avoided in primers. For symmetric 
PCR, a 40-60% G+C content is often recommended for both primers, 
with no long stretches of any one base. The classical melting temperature 



wo 96/17080 ^^CT/GB9S/02734 

11 

calculations used in conjunction with DNA probe hybridization studies 
often predict that a given primer should anneal at a specific temperature 
or that the 72*C extension temperature will dissociate the primer/template 
hybrid prematurely. In practice, the hybrids are more effective in the 
5 PCR process than generally predicted by simple T„ calculations. 

Optimum annealing temperatures may be determined empirically and may 
be higher than predicted. Tag DNA polymerase does have activity in the 
37-55 "C region, so primer extension will occur during the annealing step 
10 and the hybrid will be stabilized. The concentrations of the primers are 
equal in conventional (symmetric) PCR and, typically, within 0.1- to 1- 
fiM range. 

As an alternative to detecting the product of DNA amplification using 
15 agarose gel electrophoresis and ethidium bromide staining of the DNA, it 
is convenient to use a labelled oligonucleotide capable of hybridising to the 
amplified DNA as a probe. When the amplification is by a PCR the 
oligonucleotide probe hybridises to the interprimer sequence as defined by 
the two primers. The oligonucleotide probe is preferably between 10 and 
20 50 nucleotides long, more preferably between 15 and 30 nucleotides long. 
The probe may be labelled with a radionuclide such as '^P.^'P and "S 
using standard techniques, or may be labelled with a fluorescent dye. 
When the oligonucleotide probe is fluorescently labelled, the amplified 
DNA product may be detected in solution (see for example Balaguer et al 
25 (1991) ''Quantification of DNA sequences obtained by polymerase chain 
reaction using a bioluminescence adsorbent" Anal. Biochem. 195, 105-1 10 
and Dilesare et al (1993) "A high-sensitivity electrochemiluminescence- 
based detection system for automated PCR product quantitation" 
BioTechniques 15, 152-157. 

30 
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Any of the DNA amplification protocols can be used in the method of the 
invention including the polymerase chain reaction, QB replicase and ligase 
chain reaction. The polymerase chain reaction is particularly preferred 
because of its simplicity. 

5 

The polymerase chain reaction may, if desired, be carried out in situ, or 
at least in whole tissue samples isolated from the patient, using the 
methods described in Komminoth et al (1994) Pathol. Res. Pract. 190(1), 
1017-1025, incorporated herein by reference; and Komminoth et al (1994) 
10 Verhandlungers der Deutschen Gesellschaft jur Pathologic 78, 146-152, 
incorporated herein by reference. 

In principle it is possible to detect the presence of a CK20 gene product 
by determining whether the sample contains any CK20 protein. This is 

15 most conveniently achieved using antibodies that react specifically with 
CK20. For example, monoclonal antibody K, 20.8 reactive against CK20 
can be purchased from Cymbus Bioscience Limited, Southampton, UK or 
other suitable monoclonal antibodies can be made using methods well 
known in the art. For example, suitable monoclonal antibodies to CK20 

20 may be prepared using the techniques described in Monoclonal Antibodies: 
a manual of techniques, H. Zola (CRC Press. 1988) and in Monoclonal 
Hybridoma Antibodies: Techniques and Applications, J.G.R. Hurrell 
(CRC Press. 1982). 

25 Conveniently, the antibody is labelled with a readily detectable marker 
such as a radionuclide or fluorescent dye. Suitable radionuclides include 
'*'Tc, '"I and '^P. Suitable fluorescent dyes include fluorescein. 

Once it has been determined, according to the methods of the invention 
30 whether a human patient has a tumour or is likely to develop metastatic 
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disease from such a tumour, the physician can decide on an appropriate 
course of treatment. Other methods may supplement the present method 
in order to reach a diagnosis. 

5 A further aspect of the invention provides a kit of parts comprising 
oligonucleotide primers capable of amplifying a cytokeratin 20 (CK20) 
cDNA, deoxynucleotides and a DNA polymerase. 

The kit may further comprise means for extracting RNA suitable primers 
10 for making CK20 cDNA, an enzyme with reverse transcriptase activity, 
and means for detecting an amplified DNA product. 

The invention will now be described in more detail with reference to the 
following Figures and Examples in which: 

15 

Figure 1 shows the products of RT-PCR for CK8, 19 and 20 mRNA on 
1 pg to 100 ng of total mRNA isolated from RTl 12 (CK8), MCF7 (CK19) 
and HT29 (CK20) cell lines. A single band of 244, 214 and 370 bp 
respectively was identified after separation of products in an agarose gel 
20 and staining with ethidium bromide. There was an increase in the 
intensity of this band with increasing RNA concentration. 

M = molecular weight markers, W = water control. 

25 Figure 2 shows the products of RT-PCR for CK8(i). 19(ii) and 20(iii) 
mRNA separated by agarose gel electrophoresis and stained with ethidium 
bromide in 6 control bloods (1-6). 

C = positive control for CK8, 19 or 20 mRNA detection. W = water 
30 negative control. M = molecular weight markers. 
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Figure 3 shows the products of RT-PCR for CK20 mRNA separated by 
electrophoresis and stained with ethidium bromide in 6 control bone 
marrow samples (i). RT-PCR for GAP-DH in the same RNA samples 
Cii). 

5 

C — positive control of RNA extracted from HT29 cells. W = water 
negative control. M » molecular weight markers. 

Figure 4 shows the products of RT-PCR for CK20 mRNA separated by 
10 agarose gel electrophoresis and stained with ethidium bromide in blood 
samples spiked with 1 to 10^ HT29 cells. A single 370 bp band was 
identified when as few as 100 cells per ml of whole blood were analysed 
(3). No band was identified in unspiked blood (O). RT negative samples 
showed no amplified band (ii). 

15 

+C = positive control of RNA extracted from HT29 cells. W = water 
negative control. M molecular weight markers. 

Figure 5 shows the nucleotide sequence of part of a human CK20 mRNA 
20 with the amino acid sequence of the encoded polypeptide composed of the 
exon sequence taken from Accession X73501, (bases 1 to 18061). 

Author: Zimbelmann, R; Title: Direct Submission; Journal: Submitted 
(14 Sep 1993) to the EMBL/GenBank/DDBJ databases. R. Zimbelmann, 
25 German Cancer Research Center, Division of Cell Biology, Im 
Neuenheimer Feld 280, 69254 Heidelberg, FRO. 

Intron positions are shown with a II. 
30 Figure 6 shows the nucleotide sequence of part of a human CK20 mRNA 
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with the amino acid sequence of the encoded polypeptide composed of the 
exon sequence taken from as in 

Authors: Moll, R., Zimbelmann, R., Goldschmidt, M.D., Keith, M., 
5 Laufere, J., Kasper, M.. Koch, P.J. and Franke, W.W; Title: The 
human gene encoding cytokeratin 20 and its expression during fetal 
development and in gastrointestinal carcinomas; Journal: Differendation 
53(2), 75-93 (1993). 

10 Intron positions are shown with a II. 

Figure 7 shows the nucleotide sequence of part of a human CK20 mRNA 
taken from Accession X73502 

15 Reference: 1 (bases 1 to 1461); Authors: Calnek. D. and Quaroni, A.; 
Title: Differential localization by in situ hybridization of distinct keratin 
mRNA species during intestinal epithelial cell development and 
differentiation; Journal: Differentiation 53(2), 95-104 (1993); Medline: 
93366035. 

20 

Reference: 2 (bases 1 to 1461); Author: Quaroni, A.; Title: Direct 
Submission; Journal: Submitted (07 Sep 1993) to the 
EMBL/GenBank/DDBJ databases. A. Quaroni, Cornell University, 724 
A Vet. Res. Tower, Section of Physiology, Cornell University, Ithaca, 
25 NY 14853, USA. 
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Example 1; PctCCtion of CK20-exnressing enhhglial t^lU in 
Materials and mpthnrfc 

5 

Cell Lines. The three well-characterised human cell lines used in the 
study were the transitional cell carcinoma-derived RT112 cell line, the 
breast adenocarcinoma MCF-7 cell line and the colonic adenocarcinoma 
HT29 cell line. MCF-7 and HT29 cells express CK8, CK18 and CK19 

10 (Moll et al (1982) Cell 31, 1 1-24). whereas RTl 12 express CK8, CK18, 
CK19 along with other CK isotypes characteristic of bladder epithelial 
cells (Wu et al (1982) Cell 31. 693-703). CK20. the most recenUy 
identified CK isotype, is expressed by HT29 cells, but not by MCF-7 cells 
(Moll et al (1992) Am. J. Pathol. 140, 427-447). All cell lines were 

15 maintained in a 1:1 mixture of DMEM and RPMl 1640 media 
supplemented with 5% foetal bovine serum and passaged using 0.25% 
trypsin in versene (0.02% EDTA). 

Blood and bone marrow samples. Normal blood or bone marrow 
20 samples were obtained from volunteers aged between 18 to 45 years. 
Samples were taken into EDTA, aliquoted into 2 ml volumes and frozen 
at -80'C until required for RNA extraction. 

RNA extraction. Total cellular RNA was extracted from cell lines, 
25 normal whole blood, normal bone marrow or spiked normal blood using 
Ultraspec™ RNA (Biogenesis, Bournemouth, UK) according to the 
manufacturer's instructions. 

Reverse transcriptase-polymerase chain reaction (RT-PCR). The RT- 
30 PCR method used was based on that for the detection of neuroblastoma 
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cells (Burchill et al (1994) Int. J. Cancer 57, 671-675). Briefly, dilution 
curves of RNA were DNase treated and reverse transcribed to produce 
cDNA using a random hexamer primer. RT products were amplified by 
PCR for CK8, CK19 or CK20 (primer sequences are given in Table 1). 
5 RT-PCR products were analysed by agarose gel electrophoresis and 
ethidium bromide staining. Reverse transmptase negative controls (RT- 
ve) in which reverse transcriptase enzyme was omitted were included for 
all RT-PCR reactions. Water negative controls (W) contained all 
components for the RT-PCR reaction but no targeted RNA. Where 
10 appropriate, positive controls ( + C) of RNA exu-acted from HT29, MCF7 
or RT112 cells were included. Molecular weight markers (^X 174 RF 
DNA/Hae HI, Gibco BRL, Paisley, Scotland or 123 bp ladder, Pharmacia, 
Milton Keynes, UK) were included on all agarose gels. 



15 The quality of RNA was confirmed by amplification of cDNA for GAP- 
DH or 18S probed Northern blot analysis. All primers were purchased 
from Oswell DNA Services (Edinburgh, Scotland). 



Table 1. Primer .sequences used for PCR amplincation nf CK8. CK19 
20 and CK20 





Sense primer 


Antlsense primer 


|CK8 


AACAACCTTAGGCGGCAGCT 

(SEQ ID No 13) 


GCCTGAGGAAGTTGATCTCG 
(SEQ ID No 14) 


lcK19 


GCGGGACAAGATTCTTGGTG 

(SEQ ID No 15) 


CTTCAGGCCTTCGATCTGCAT 

(SEQ ID No 16) 


CK20 


CAGACACACGGTGAACTATGG 

(SEQ ID No 1) 


GATCAGCTTCCACTGTTAGACG 

(SEQ ID No 2) 



Primer sequences for PCR were selected using the Dieffenbach Selection 
Programme. Primers were located within different exons and were either 
20, 21 or 22 mers. 
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Specificity of RT-PCR. RT-PCR products were separated on agarose 
gels and Southern blotted onto nylon membrane (Hybond N^, Amersham, 
UK). Filters were hybridised with a gamma end-labelled 
oligonucleotide probe, the sequence of which lay between each primer set. 
5 The nucleotide sequence of RT-PCR products was confirmed by dideoxy 
chain termination sequencing (Sequenase, USS, Canada). 

Cell spiking* Cell spiking experiments were used to test the potential 
sensitivity of this technique for detection of colon carcinoma cells in 
10 blood. Known numbers of HT29 cells were added to whole blood 
samples, mRNA extracted and RT-PCR for CK20 performed. To 2 ml 
aliquots of whole blood 10 to 1 x 10^ cells were added; an unspiked blood 
sample was included in each experiment. RNA (100 pg) from HT29 cells 
was included as a positive control. 

15 

Results 

RT-PCR detection of bladder, breast and colon carcinoma cells. RT- 
PCR for CK8 generated a single 244 bp band identified on ethidium 
20 bromide stained agarose gels (Fig l,i). This fragment was confirmed by 
Southern blot analysis and sequencing (data not shown) to be the fragment 
of CK8 which lies between the two primers used for PGR. The band was 
detected in 10 pg-100 ng of total RNA from RT112 cells. 

25 RT-PCR for CK19 generated a single 214 bp band identified on ethidium 
bromide stained agarose gels (Fig l,ii) which was confirmed to be CK19 
by Southern blot analysis and sequencing (data not shown). This band 
was detected in 100 pg-1 ng of total RNA from MCF7 cells. 

30 RT-PCR for CK20 generated a single band of 370 bp (Fig l,iii). This 
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band was confirmed by Southern hybridisation and sequence analysis (data 
not shown) and detected in 100 pg of total RNA firom HT29 cells. 

In all three cases there was an increase in band intensity with increasing 
5 amounts of RNA (Fig 1). No transcripts were identified in water control 
samples (Fig 1) or RT negative samples (results not shown). 

Control blood and bone marrow analysis. In 8/9 and 6/15 control blood 
.samples analysed CK8 and CK19 RT-PCR products were identified under 

10 the described conditions. Southern blotting confirmed amplified bands 
were CK8 and CK19 RT-PCR products. In 15/15 control blood samples 
analysed, CK20 was undetectable by ethidium bromide staining or 
Southern blot hybridisation. RT-PCR results for CK8(i), CK19(ii) or 
CK20(iii) arc shown in Fig 2 for six control bloods, 6/6 were positive for 

15 CK8, 3/6 for CK19 and 0/6 for CK20. RT-PCR for CK20 in 6/6 normal 
bone marrow samples showed no amplified bands (Fig 3,i). The integrity 
of bone marrow RNA samples was confirmed by RT-PCR for GAP-DH 
(Fig 3,ii). 

20 Cell spiking. In HT29 cell spiking experiments it was possible to detect 
down to 100 HT29 cells diluted in 2 ml of whole human blood (Fig 4,i). 
The 370 bp band generated was shown by Southern blotting to hybridise 
to a '^P end-labelled oligonucleotide probe specific for CK20 and 
confirmed by sequence analysis to be that of CK20 (results not shown). 

25 No RT-PCR products were detected in whole blood alone (Fig 4,i 0). 
RT-PCR products were not identified in reverse transcriptase negative 
samples (Fig 4,ii). 

Since CK8 and CKI9 were expressed in a high proportion of normal 
30 peripheral blood samples (88% and 40% respectively) neither would be 
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suitable targets for detection of tumour cells in peripheral blood. 

CK20 mRNA was not detected in any normal blood or bone marrow 
samples examined suggesting it to be the CK of choice for detection of 
5 carcinomas of epithelial origin. CK20 has been detected in almost all 
cases of colorectal adenocarcinomas by immunohistochemistry and is a 
useful target for the detection of disseminating colon carcinoma by RT- 
PCR. 

10 Example 2; Determining whether a patient has an epithgiial cell 

immuu: 

S ml of blood is removed from the patient. RNA is produced from the 
blood as described in Example I and a RT-PCR is carried out using the 
15 CK20-specific primers as described in Example 1. If a CK20-specific 
DNA product is amplified it suggests that the patient may have an 
epithelial cell tumour. Further confirmatory tests may be performed in 
order to reach a diagnosis. 

20 Example 3 : Determining whether a patient with colorectal carcinoma 
tumours is likelv to develop metastases 

S ml of blood is removed from the patient. RNA is produced from the 
blood as described in Example 1 and a RT-PCR is carried out using the 
25 CK20-specific primers as described in Example 1 . If a CK20-speciiic 
DNA product is amplified it suggests that the patient is likely to develop 
metastatic disease disseminated from the colorectal tumour. Further 
confirmatory tests may be performed in order to reach a diagnosis. 
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CLAIMS 

1 . A method of determining whether a human patient has a tumour or 
whether a tumour has metastasised comprising the steps of (1) obtaining 
5 a sample of tissue from the patient, the said tissue being one that does not 
normally contain a cytokeratin 20 (CK20) gene product and (2) 
determining whether a cytokeratin 20 (CK20) gene product is present in 
said tissue sample. 

10 2. A method according to Claim 1 wherein the tumour is an epithelial 
cell tumour. 



3. A method according to Claims 1 or 2 wherein the tissue is blood 
or bone marrow. 

15 

4. A method according to any one of Claims 1 to 3 wherein the 
epithelial cell tumour is any one of colorectal adenocarcinoma, stomach 
adenocarcinoma, mucinous ovarian adenocarcinoma, bladder carcinoma, 
gall bladder adenocarcinoma and transitional cell carcinoma. 

20 

5. A method according to Claim 4 wherein the epithelial cell tumour 
is colorectal adenocarcinoma. 

6. A method according to any one of the preceding claims wherein the 
25 presence of the cytokeratin 20 (CK20) gene product is determined by 

detecting cytokeratin 20 (CK20) messenger RNA (mRNA). 

7. A method according to Claim 6 wherein the cytokeratin 20 
messenger RNA (mRNA) is copied into complementary DNA (cDNA). 

30 
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8. A method according to Claim 7 wherein the complementary DNA 
(cDNA) is amplified. 

9. A method according to Claim 8 wherein the complementary DNA 
5 (cDNA) is amplified using the polymerase chain reaction (PCR). 

10. A mediod according to any one of Claims 6 to 9 wherein genomic 
DNA is removed from or cleaved in the said sample prior to detecting 
cytokeratin 20 (CK20) messenger RNA (mRNA). 

0 

11. A method according to Claim 9 or 10 wherein the primers used in 
the polymerase chain reaction (PCR) comprise or consist of the sequences 
5'-CAGACACACGGTGAACTATGG-3' (SEQ ID No 1) and 5'- 
GATCAGCTTCCACTGTTAGACG-3' (SEQ ID No 2). 

5 

12. A kit of parts comprising oligonucleotide primers for amplifying a 
cytokeratin 20 (CK20) cDNA. deoxynucleotides and a DNA polymerase. 
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T 

GTGAGCAGAG GAGGAGTTTC TTGCCTGTGG ACTTCATAAA AGCCTAGCTC 

AACACCCTCC ATGAGACACA CTCTGCCCCA ACCATCCTGA AGCTACAGGT 

GCTCCCTCCT GGAATCTCCA ATOGATTTCA GTCGCAGAAG CTTCCACAGA 

MDFSRR SFHR 

AGCCTGAGCT CCTCCTTGCA GGCCCCTGTA GTCAGTACAG TGGGCATGCA 
SLSS SLQ APVVSTVGMQ 

GCGCCTCGGG ACGACACCCA GCGTTTATGG GGGTGCTGGA GGCCGGGGCA 
RLG TTPS VYGGAGGRGI 

TCCGCATCTC CAACTCCAGA CACACGGTGA ACTATGGGAG CGATCTCACA 
Rl SNSR HTVN YGS DLT 

GGCGGCGGGG ACCTGnTGT TGGCAATGAG AAAATGGCCA TGCAGAACCT 
GGGD LFVGNE KMAMQNL 

AAATGACCGT CTAGCGAGCT ACCTAGAAAA GGTGCGGACC CTGGAGCAGT 
NDR LASYLEK VRT LEQS 

CCAACTCCAA ACTTGAAGTG CAAATCAAGC AGTGGTACGA AACCAACGCC 
NSK LEVQIKQWYE TNA 

CCGAGGGCTG CTCGCGACTA CAGTGCATAT TACAGACAAA TTGAAGAGCT 
PRAG RDYSAY YRQ I EEL 

GCGAAGTCAGIIATTAAGGATG CTCAACTGCA AAATGCTCGG TGTGTCCTGC 
RSQ IKDAQLQNARCVLQ 

AAATTGATAA TGCTAAACTG GCTGCTGAGG ACTTCAGACT GAA||GTATGAG 
IDN AK LAAEDFRLK YE 

ACTGAGAGAG GAATACGTCT AACAGTGGAA GCTGATCTCC AAGGCCTGAA 
TERG IRLTVEADLQGLN 

TAAGGTCnr GATGACCTAA CCCTACATAA AACAGATTTG GAGATTCAAA 
KVF DDLT LHK TDLEI Ql 

TTGAAGAACT GAATAAAGAC CTAGCTCTCC TCAAAAAGGA GCATCAGGAGfl 
EEL NKD L ALL KKE HQE 



GAAGTCGATG GCCTACACAA GCATCTGGGC AACACTGTCA ATGTGGAGGT 
EVDG LHKHLGNTVNVEV 

TGATGCTGCT CCAGGCCTGA ACCTTGGCGT CATCATGAAT GAAATGAGGC 
DAA PGLN LGVIMN EMRQ 

AGAAGTATGA AGTCATGGCC CAGAAGAACC TTCAAGAGGC CAAAGAACAG 
KYEVMAQKNLQEAKEQ 



Figure 5 (sheet 1 of 2) 
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902 TTTG AGAGAC AG||ACTGCAGT TCTGCAGCAA CAGGTCACAG TGAATACTGA 
FERQ TAVLQQQVTVNTE 

952 AGAATTAAAA GGAACTCAGG TTCAACTAAC GGAGCTGAGA CGCACCTCCC 
ELKGTEV QLTELRRTSO 



1002 



ACAGCCTTGAGATAGAACTC CAGTCCCATC TCAGCATG||AA AGACTCriTG 
IE LQSHL SMK ESL 



1052 GAGCACACTC TAGAGGAGAC CAAGGCCCGT TACAGCAGCC ACTTAGCCAA 
EHTLEETKARYSSQLAN 

1 102 CCTCCAGTCG CTGTrGAGCT CTCTGGAGGC CCAACTGATG CAGATTCGGA 
LQSLLSS LEA QLMQIRS 

1 152 GT AACATGGA ACGCCAGAAC AACGAATACC ATATCCTTCT TGACATAAAG 
NME RQNNEYHILLDI K 

1202 ACTCGACTTG AACAGGAAAT TGCTACTTAC CGCCGCCTTC TGGAAGGAGA 
TRLE QE I ATY RRLL EGE 

1252 AGACGT AAA||A ACTACAGAAT ATCAGTTAAG CACCCTGGAA GAGAGAGIIATA 
DVK TTEYQLSTLEERDI 

1302 TAAAGAAAAC CAGGAAGATT AAGACAGTCG TGCAAGAACT AGTGGATGGC 
KKTRKI KTVV QEVVDG 

1352 AAGGTCGTGT CATCTGAAGT CAAAGAGGTG GAAGAAAATA TCfTAAIIATAGC 
KVVS SEV KEVEENI • 

1402 TACCAGAAGG AGATGCTGCT GAGGTnTGA AAGAAATTTG GCTATAATCT 

1452 TATCnrCCT CCCTGCAAGA AATCAGCCAT AAGAAAGCAC TATTAATACT 

1502 CTGCAGTGAT TAGAAGGGGT GGGGTGGCGG GAATCCTATT TATCAGACTC 

1552 TGTAATTGAA TATAAATGTT TTACTCAGAG GAGCTGCAAA TTGCCTGCAA 

1602 AAATGAAATC CAGTGAGCAC TAGAATATTT AAAACATCAT TACTGCCATC 

1652 TITATCATGA AGCACATCAA TTACAAGCTG TAGACCACCT AATATCAATT 

1702 TGTAGGTAAT GTTCCTGAAA ATTGCAATAC A TTCAATTA TACTAAACCT 

1752 CACAAAGTAG AGGAATCCAT GTAAATTGCA AATAAA 

Figure 5 (sheet 2 of 2) 



SUBSTITUTE SHEET (RULE 26} 



.0617080A1_L> 



wo 96/17080 PCT/GB95ra2734 

7/9 



1 T 

2 GTCAGCAGAG CAGGACTTTC TTGCCTGTGG ACTTCATAAA AGGCTAGCTC 

52 AACACCCTCC ATGAGACACA CTCTGCCCCA ACCATCCTGA AGCTACAGGT 

102 GCTCCCTCCTGGAATCTCCA ATOGATTTCA GTCGCAGAAG CTTCCACAGA 

MDFSRR SFHR 

152 AGCCTGAGCT CCTCCTTGCA GGCCCCTGTA GTCAGTACAG TGGGCATGCA 
SLSS SLQ APVVSTVGMQ 

202 GCCCCTCGGG ACGACACCCA GCGTTTATGG GGGTGCTGGA GGCCGGGGCA 
RLG TT PS VYGGAGGRG I 

252 TCCCCATCTC CAACTCCAGA CACACGGTGA ACTATGGGAG CGATCTCACA 
SNSRHTVN YGS DLT 

302 GGCGGCGGGG ACCTGTTTGT TGGCAATGAG AAAATGGCCA TGCAGAACCT 
G GG D LF V GNE KM AM QNL 

352 AAATGACCGT CTAGCGAGCT ACCTAGAAAA GGTGCGGACC CTGGAGCAGT 
NDR LASYLEK VRT LEQS 

402 CCAACTCCAA ACTTGAAGTG CAAATCAAGC AGTGGTACGA AACCAACCGC 
NSK LEVQIKQWYE TNR 

452 CCGAGGGCTG GTCGCGACTA CAGTGCATAT TACAGACAAA TTGAAGAGCT 
FRAG RDYSAY YRQI EEL 

502 GCGAAGTCAG||ATTAAGGAtG CTCAACTGCA AAATGCTCGG TGTGTCCTGC 
RSQ IKDAQLQNARCVLQ 

552 AAATTGATAA TGCTAAACTG GCTGCTGAGG ACTTCAGACT GAA||GTATGAG 
IDN AK LAAE DFRLK YE 

602 actgagagag gaatacgtct aacagtggaa gctgatctcc aaggcctgaa 
terg irltveadlqgln 

652 taaggtcnr gatgacctaa ccctacataa aacagatttg gagattcaaa 
kvf ddlt lhk tdlei qi 

702 TTGAAGAACT GAATAAAGAC CTAGCTCTCC TCAAAAAGGA GCATCAGGAGil 
EEL NKDLALL KKE HQE 



752 GAAGTCGATG GCCTACACAA GCATCTGGGC AACACTGTCA ATGTGGAGGT 
EVDG LHKHLG NTVN VEV 

802 TGATGCTGCT CCAGGCCTGA ACCTTGGCGT CATCATGAAT GAAATGAGGC 
DAA PGLN LGVIMN EMRQ 

Figure 6 (sheet 1 of 2) 
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AGAAGTATGA AGTCATGGCC CAGAAGAACC TTCAAGAGGC CAAAGAACAG 
KY E VMA Q KN L QEA K E Q 

TTTGAGAGAC AG|I ACTGCAGT TCTGCAGCAA CAGGTCACAG TGAATACTGA 
FERQ TAVLQQQVTVNTE 



1102 
1152 
1202 

1252 



AGAATTAAAAGGAACTGAGG TTCAACTAACGGAGCTGAGACGCACCTCCC 

E lkgtev qlt elrrts q 
1002 agagccitga gatagaactc cagtcccatc tcagcatgIIaa AGAGTCnrO 

^ ^ ^ IE LQSHL SMK ESL 

1052 GAGCACACTC TAGACGAGAC CAAGGCCCGT TACAGCAGCC AGTTAGCCAA 
fcHTLEETKARYSSQLAN 

CCTCCAGTCG CTGTTGAGCT CTCTGGAGGC CCAACTGATG CAGATTCGGA 
LQSLLS S LEA QLMQIRS 

GTAACATGGA ACGCCAGAAC AACGAATACC ATATCOTCT T^^^^^^ 

ACTCGACTTG AACAGG AAAT TGCTACTTAC CGCCCCCTTC TGGAAGGAGA 
TRLE QEIATYRRLLEGE 

AGACGTAAAllA ACTACAGAAT ATCAGTTAAG CACCCTGGAA GAGAGAGIIaTA 
o V K TTEYQLSTLE ERD i 

1302 TA^GAAAAC CAGCAAGATT AAGACAGTCG TGCAAGAAGTACTGGATGGC 
•^J^TRKl KTVV QEVVDG 

1352 AAGGTCGTGT CATCTCAAGT CAAAGAGGTG GAAGAAAATA TqrAA||ATAGC 
KVVS SEV KEVEENI • 

1402 TACCAGAAGG AGATGCTGCT GAGGmTGA AAGAAATTTG GCTATAATCT 
1452 TATCriTGCr CCCTGCAAGA AATCAGCCAT AAGAAAGCAC TATTAATACT 

1502 CTGCAGTGAT TAGAAGGGGT GGGGTGGCGG GAATCCTATT TATCAGACTC 

1552 TGTAATTGAA TATAAATGTT TTACTCAGAG GAGCTGCAAA TTGCCTGCAA 

1602 AAATGAAATC CAGTGAGCAC TAGAATATTT AAAACATCAT TACTGCCATC 

1652 TTTATCATGA AGCACATCAA TTACAAGCTG TAGACCACCT AATATCAATT 

1702 TGT AGGTAAT GTTCCTGAAA ATTGCAATAC A TTCAATTA TACTAAACCT 

1752 CACAAAGTAGAGGAATCCATGTAAATTGCAAATAAA 

Figure 6 (sheet 2 of 2) 
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li GGACCCTGGA GCAGTCCAAC TCCAAACTTG AAGTGCAAAT 

10? ^t^m"" TACGAAACCA ACGCCCCGAG GGCTGGTCGC GACTAci^G 

101 CATATTACAG ACAAATTGAA GAGCTGCGAA GTCAGATTAA rrrS^™;? 

151 CTGCAAAATG CTCGGTGTGT CCTGCAAATT GaJJJt^ fJSJ^SS^ 

Iti agactgaagt atgagSS?^ gagJSg^S 

^ni tctccaaggc ctgaataagg tctttgatga cctS^Sot? 

301 CATAAAACAG ATTTGGAGAT TCAAATTGAA GAACTGAATA aIAJ^SS™ 
TCTCCTCAAA AAGGAGCATC AGGAgSSct cSJJggJS^A ^S^SJJS 
TGGGCAACAC TGTCAATGTG GAGGTTGATG CTGCTCCAcr r^S-T^Si^ 
??fSJSi^^^ TGAATGAAAT GAGgSg^G S?SJgJcA tSc^SI 

gaggccaaag aacagtttga gagacagact gcag55J5^ 
SS?'^^^^'^ actgaagaat taaaaggaac tSa?^t5^ 
ctaacggagc tgagacgcac ctcccagagc CTTGAGATAG AACTCCAGTC 

CCCg5?aCAG cla^riil CACTCtJgAG ^ScS^gS 

CCCGTTACAG CAGCCAGTTA GCCAACCTCC AGTCGCTGTT GAGCTCrrrr 
GAGGCCCAAC TGATGCAGAT TCGGAGTAAC ATGGAACGCr rrf fJflSJ? 
ATACCATATC CTTCTTGACA TAAAGA^JJ^ A^^^^cfc SSJJ^JSctJ 
CTTACCGCCG CCTTCTGGAA GGAGAAGACG TAAAAACTAC AGaJSSSg 

IJJtJSSJS?'' tggaagagag agatataaag aaaaccacca ag^^Sac 

,nn, GAAGTAGTGG ATGGCAAGGT CGTGTCATCT GAAGT^S 

SSST??^^^ AAATATCTAA ATAGCTACAG AAGGAGATGC 5^0^^ 
1051 TTGAAAGAAA TTTGGCTATA ATCTTATCTT TGCTCCCTGC Aa?I?ASSI^ 
1101 CCATAAGAAA GCACTATTAA TAC???GCAa ^GOTA^G ^G^S^^S? 
mi GCCGGAATCC TATTTATCAG ACTCTGTAAT TGAATATA^ 5g???5aS? 
lll^ CAAATTGCCT GCAAAAATGA AATC^JIJJJ GcISIgS? 

1251 ATTTAAAACA TCATTACTGC CATCTTTATC ATGAAGCACA TrAA?T^?Jrr 
llli t^nlfS!^'''' ACCTAATATC AATTTGTAGG Ji^TC^^CCT gSJJSgJJ 
III, ^rnf^T^^^ ATTATACTAA ACCTCACAAA GTAGAGGAAT CCM^SSS^ 
itli. ISS^i^ CCACTTTCTA ATTTTTAAAA AAAAAAAAAA 



351 

401 

451 

501 

551 

601 

651 

701 

751 

801 

851 

901 

951 



/translation^-EKVRTLEQSNSKLEVQIKQWYETNAPRAGRDYSAYYRQIEELRS 

QIKDAQLQNARCVLQIDNAKLAAEDFRLKYESERGIRLTVEADLQGLNKVFDDLTLHK 

TDLEIQIEELNKDLALLKKEHQEEVDGLHKHLGNTVNVEVDAAPGLNLGVIMNEMRQK 

YEVMAQKNLQEAKEQFERQTAVLQQQVTVNTEELKGTEVQLTELRRTSQSLEIELQSH 

LSMKESLEHTLEETKARYSSQLANLQSLLSSLEAQLMQIRSNMERPNNEYHILLDIKT 

RLEQEIATYRRLLEGEDVKTTEYQLSTLEERDIKKTTKIKTWQEWDGKWSSEVKE 
VEENI" 



Figure 7 
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